首页> 外文OA文献 >Computer vision systems for automatic analysis of face and eye images in specific applications of interpretation of facial expressions
【2h】

Computer vision systems for automatic analysis of face and eye images in specific applications of interpretation of facial expressions

机译:在面部表情解释的特定应用中用于自动分析面部和眼睛图像的计算机视觉系统

摘要

This thesis is about the computer vision-based automation of specific tasks of face perception, for specific applications where they are essential. These tasks, and the applications in which they are automated, deal with the interpretation of facial expressions.Our first application of interest is the automatic recognition of sign language, as carried out via a chain of automatic systems that extract visual communication cues from the image of a signer, transcribe these visual cues to an intermediary semantic notation, and translate this semantic notation to a comprehensible text in a spoken language. For use within the visual cue extraction part of such a system chain, we propose a computer vision system that automatically extracts facial communication cues from the image of a signer, based on a pre-existing facial landmark point tracking method and its various robust refinements. With this system, our contribution notably lies in the fruitful use of this tracking method and its refinements within a sign language recognition system chain. We consider the facial communication cues extracted by our system as facial expressions with a specific interpretation useful to this application.Our second application of interest is the objective assessment of visual pursuit in patients with a disorder of consciousness. In the clinical practice, this delicate assessment is done by a clinician who manually moves a handheld mirror in front of the patient's face while simultaneously estimating the patient's ability to track this visual stimulus. This clinical setup is appropriate, but the assessment outcome was shown to be sensitive to the clinician's subjectivity. For use with a head-mounted device, we propose a computer vision system that attaches itself to the clinical procedure without disrupting it, and automatically estimates, in an objective way, the patient's ability to perform visual pursuit. Our system, combined with the use of a head-mounted device, therefore takes the form of an assisting technology for the clinician. It is based on the tracking of the patient's pupil and the mirror moved by the clinician, and the comparison of the obtained trajectories. All methods used within our system are simple yet specific instantiations of general methods, for the objective assessment of visual pursuit. We consider the visual pursuit ability extracted by our system as a facial expression with a specific interpretation useful to this application.To some extent, our third application of interest is the general-purpose automatic recognition of facial expression codes in a muscle-based taxonomic coding system. We do not actually provide any new computer vision system for this application. Instead, we consider a supervised classification problem relevant to this application, and we empirically compare the performance of two general classification approaches for solving this problem, namely hierarchical classification and standard classification ("flat" classification, in this comparative context). We also compare these approaches for solving a classification problem relevant to 3D shape recognition, as well as artificial classification problems we generate in a simulation framework of our design. Our contribution lies in the general theoretical conclusions we reach from our empirical study of hierarchical vs. flat classification, which are of interest for properly using hierarchical classification in vision-based recognition problems, for example for an application of facial expression recognition.
机译:本论文是关于基于计算机视觉的面部感知特定任务的自动化,适用于必不可少的特定应用。这些任务以及其中的自动化应用程序处理面部表情的解释。我们感兴趣的第一个应用程序是自动识别手语,这是通过从图像中提取视觉传达线索的一系列自动系统执行的签名者的身份,将这些视觉提示转录为中间语义符号,然后将此语义符号转换为口语中的可理解文本。为了在这样的系统链的视觉提示提取部分中使用,我们提出了一种计算机视觉系统,该系统基于预先存在的面部界标点跟踪方法及其各种强大的改进功能,从签名者的图像中自动提取面部通讯提示。有了这个系统,我们的贡献尤其在于该跟踪方法及其在手语识别系统链中的改进的卓有成效的使用。我们将系统提取的面部通讯提示视为具有特定解释​​的面部表情,对此应用程序很有用。我们感兴趣的第二个应用程序是对意识障碍患者的视觉追求进行客观评估。在临床实践中,这种微妙的评估是由临床医生完成的,他手动将手持镜子移到患者面部的前面,同时估计患者跟踪该视觉刺激的能力。这种临床设置是适当的,但是评估结果显示对临床医生的主观性敏感。为了与头戴式设备配合使用,我们提出了一种计算机视觉系统,该系统可将其自身附加到临床过程中而不会中断它,并以客观的方式自动估算患者进行视觉追踪的能力。因此,我们的系统与头戴式设备的结合使用,为临床医生提供了一种辅助技术。它基于对患者瞳孔和临床医生移动的镜子的跟踪,以及所获得轨迹的比较。我们系统中使用的所有方法都是通用方法的简单但特定的实例,用于客观地评估视觉追求。我们将系统提取的视觉追踪能力视为具有特定解释​​的面部表情,对此应用程序很有用。在某种程度上,我们的第三个应用是基于肌肉的分类编码中的面部表情代码的通用自动识别系统。我们实际上并未为此应用程序提供任何新的计算机视觉系统。取而代之的是,我们考虑与该应用程序相关的监督分类问题,并根据经验比较两种通用分类方法在解决该问题上的性能,即分层分类和标准分类(在这种比较情况下为“扁平”分类)。我们还比较了这些方法来解决与3D形状识别相关的分类问题,以及我们在设计仿真框架中生成的人工分类问题。我们的贡献在于,通过对分层与平面分类的经验研究得出的一般理论结论,对于在基于视觉的识别问题中正确使用分层分类(例如在面部表情识别的应用中)是有意义的。

著录项

  • 作者

    Hoyoux, Thomas;

  • 作者单位
  • 年度 2017
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号